Korpus: cat_news_2022_30K

Weitere Korpora

5.2.18 Words nearly always together in sentences

Strong sentence co-occurrences with a low probability of being separated

The quotient below is calculated as freq(word1)*freq(word1)/together_freq^2.

Word 1 Word 1 Frequency of word 1 Frequency of word 2 Frequency together Qoutient
Estats Units 112 112 110 1.04
Units Estats 112 112 110 1.04
Regne Unit 46 44 43 1.09
Unit Regne 44 46 43 1.09
Le Pen 32 24 24 1.33
Ambient Medi 28 28 27 1.08
Medi Ambient 28 28 27 1.08
Pen Le 24 32 24 1.33
der Leyen 19 16 16 1.19
Leyen der 16 19 16 1.19
Donetsk Lugansk 11 11 10 1.21
Lugansk Donetsk 11 11 10 1.21
Next Generation 11 10 10 1.10
Generation Next 10 11 10 1.10
canceller Scholz 10 9 8 1.41
von Ursula 10 8 8 1.25
Aires Buenos 9 9 9 1.00
Buenos Aires 9 9 9 1.00
Lab Citizen 9 7 7 1.29
Saudita l’Aràbia 9 7 7 1.29
637 msec needed at 2023-02-24 21:02